Improved scale-cepstral analysis in speech
نویسندگان
چکیده
In this paper! we present improvements over the original scalecepstrum proposed in [ 11. The scalecepstrum was motivated by a desire to normalize the first-order effects of differences in vocal-tract lengths for a given vowel. Our subsequent work [2] has shown that a more appropriate frequency-warping than the log-warping used in [l] is necessary to account for the frequency dependency of the scalefactor. Using this more appropriate frequency-warping and a modified method of computing the scalecepstrum we have obtained improved feat,ures that provide better separability between vowels than before, and are also robust to noise.
منابع مشابه
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملPerceptual Significance of Cepstral Distortion Measures in Digital Speech Processing
Currently, one of the most widely used distance measures in speech and speaker recognition is the Euclidean distance between mel frequency cepstral coefficients (MFCC). MFCCs are based on filter bank algorithm whose filters are equally spaced on a perceptually motivated mel frequency scale. The value of mel cepstral vector, as well as the properties of the corresponding cepstral distance, are d...
متن کاملPerformance Evaluation of Blind Equalization for Mel-LPC based Speech Recognition under Different Noisy Conditions
This study is aimed to develop a noise robust distributed speech recognizer (DSR) for real-world applications by employing Blind Equalization (BEQ) for robust feature extraction. The main focus of the work is to cope with different noisy environments in recognition phase. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by appl...
متن کاملCepstral-time matrices and LDA for improved connected digit and sub-word recognition accuracy
Previous work has shown that good accuracy improvements can be made for isolated word recognition using cepstral-time matrices as the speech feature instead of the more conventional MFCC-based speech feature augmented with higher order cep-strum. This work extends the performance improvements to UK English connected digit strings and to a sub-word based town names task. Experimental results are...
متن کاملFrequency-warping in speech
In this paper we present results that indicate that the formant frequencies between di erent speakers scale di erently at di erent frequencies. Based on our experiments on speech data, we then numerically compute a universal frequencywarping function, to make the scale-factor independent of frequency in the warped domain. The proposed warping function is found to be similar to the mel-scale, wh...
متن کاملHardware Implementation of Speech Recognition Using MFCC and Euclidean Distance
This paper suggests Digital Signal processor (DSP) based speech recognition system with improved performance in terms of recognition accuracies and computational cost. The comprehensive surrey of various approaches of feature extraction like Mel filter banks with Mel Frequency Cepstrum Coefficients (MFCC). This paper describes an approach of isolated speech recognition by Digital Signal Process...
متن کامل